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Abstract 

Existing computer algebra packages do not fully support quantum mechanics 
calculations in Dirac's notation. I present the foundation for building such support: 
a mathematical system for the symbolic manipulation of expressions used in the 
invariant formalism of quantum mechanics. I first describe the essential mathemat- 
ical features of the Hilbert-space invariant formalism. This is followed by a formal 
characterisation of all possible algebraic expressions in this formalism. This char- 
acterisation is provided in the form of a set of terms. Rewrite rules over this set of 
terms are then developed that correspond to allowed manipulations of the algebraic 
expressions. This approach is contrasted with cunent attempts to build invariant 
quantum mechanics calculations into computer algebra systems. 
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1 Introduction 

The mathematical stage on which non-relativistic quantum mechanics is usually set is 
a Hilbert space. The state of a system is represented by a vector in this Hilbert spac^ll 
physical observables are represented by linear self-adjoint operators, and the time evo- 
lution is dictated, through Schrodinger's equation, by the Hamiltonian operator 

It is often useful to pick a particular concrete example of the Hilbert space in which 
to solve problems. For instance, introductory quantum mechanics problems are usually 
solved explicitly in the space _L-^(K^), the space of square-integrable complex-valued 
functions — so-called "wavefunctions". Problems in finite Hilbert spaces are often 
solved in the concrete space C^, the space of column vectors of complex numbers — 
consider, for instance, problems featuring the familiar Pauli spin matrices. 

It is well-known, though, that a more abstract approach to quantum mechanics can 
be very fruitful 1 1]. Advanced calculations are often caiTied out without any reference 
to wavefunctions, or spin-matrices. It is not always necessary to specify a particular 
concrete Hilbert space, but rather the calculation proceeds by denoting vectors in the 
abstract space and working with them directly. I will refer to this abstract approach. 



' Or a density operator. The framework described here is suitable for describing any operator, including 
density operators. I will not explicitly consider the density operator in this paper. 
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which does not make essential reference to particular vector spaces, as the invariant 
formalism. These calculations are often carried out in a notation invented by Dirac 
which, as we shall see later, has had a significant impact on efforts to automate them. 

Computer techniques have found many applications in quantum mechanics prob- 
lems |2 1. There exist a number of very sophisticated computer algebra packages IJlHl. 
Almost any integral, sum, matrix calculation, or set of differential equations can be 
entered into one of these packages and automatically be solved or simplified. Numeri- 
cal integration of differential equations permits the solution of problems depending on 
continuous parameters, including time-evolution problems. Matrix techniques can be 
used to solve problems with finite state-spaces fS]. Computer algebra techniques have 
been used to automate the derivation of otherwise prohibitively complex expressions 
0. 

However, despite the above developments, support for calculations in the invariant 
formalism is notably missing. A number of systems have been built that attempt to 
support these calculations IS |4] |7] [H |9l . But none of these correctly model all possi- 
ble calculations in the invariant formalism, especially when tensor-product spaces are 
introduced. I will argue that this is a result of trying to implement Dirac's notation 
for the invariant formalism directly, rather than modelling the underlying mathematical 
structure. 

In the sequel I will first consider in more detail the problems with basing a sys- 
tem for invariant formalism calculation directly on Dirac's notation. I will then present 
an alternative approach, as suggested above, founded on the underlying mathematical 
structure of quantum mechanics. I will provide a self-contained summary of this math- 
ematical structure. I will then go on to characterise all possible algebraic expressions 
in this structure with a set of terms. After this, I will develop a set of rewrite rules that 
correspond to legal manipulations of the algebraic expressions. After a short example 

1 will extend the system to Hilbert spaces with a tensor-product structure and present a 
fuller example. 

2 Dirac's notation 

Dirac developed a notation for calculations in the invariant formalism that is both pow- 
erful and easy to use 1 10|. His insight was to make the notation purposefully ambigu- 
ous. Most significantly, two types of symbol are introduced, bras and kets, that can be 
used to represent a variety of different mathematical objects. Alone these kets and bras 
represent state vectors and their duals respectively. In combination they can represent 
inner products, operators, and tensor products. The power of the notation lies in the 
fact that the syntactic ambiguity reflects, at least in most cases, some kind of equality 
between the denoted mathematical objects. Roughly speaking, if a combination of bras 
and kets 'looks like' it could represent one of the above objects, it probably can. This 
flexibility and ambiguity makes the notation very comfortable to work with in a prac- 
tical sense, reducing the number of tedious, mechanical steps in a calculation. Dirac's 
notation has deservedly found widespread acceptance. 

Consider the following example, the application of a projection operator to a state 
in the Hilbert space H, 

m = l«> • 

This can be written explicitly in Dirac's notation as 

|*) = (|0)(0|)|a) , (1) 
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where I have bracketed the projection operator to emphasise that this expression rep- 
resents an operator that maps H H, acting on an element of H. However, this 
expression can also be interpreted as 

\^) = Ma)) , (2) 

where the expression is now to be read as an element of H multiplied by a complex 
number, the inner-product of two vectors. The beauty of Dirac's notation is that both of 
the above expressions are extensionally equal. Conventionally, the expression would 
not be bracketed, giving the reader the flexibility to read the expression both ways ([T] 
and|2]). 

The scalar multiplication in ^ is commutative and so the expression can be rewrit- 
ten as 

l*> = (0|a> \^) , 
It is, however, not acceptable to rearrange equation[T]to give 

Although any practising quantum-physicist would immediately recognise the above 
manipulation as invalid, most would be unable to easily provide a formal specification 
of which manipulations are allowed and which are not. Working in Dirac's notation 
is a skill that is learnt and the rules are complex. In the author's experience, many 
physics students struggle to understand the subtleties of manipulation of complex Dirac 
expressions, particularly where tensor product spaces and operators are involved. 

Formally, Dirac's notation represents five separate mathematical operations with 
the same syntactic form: juxtaposition. The operations of scalar multiplication, taking 
the inner product, operator application, operator composition, and forming the tensor 
producj^ are all represented by butting bras and kets up against one another The mean- 
ing of a bra or a ket in a Dirac expression depends heavily on its context, and often 
there can be more than one context that is applicable for a given expression. Mechani- 
cal rules for re-arranging the symbols are thus difficult to provide because whether it is 
admissible to manipulate a symbol depends on the symbol's context. To successfully 
work in Dirac's notation one must understand the meaning of the expressions, as it is 
these meanings that provide the context. It is this point that makes a computer imple- 
mentation of Dirac's notation very difficult. Computer algebra systems have no notion 
of meaning — they are automata that perform mechanistic structural re-arrangement 
of expressions. 

Previous software packages have typically tried to capture the intricacies of Dirac's 
notation by treating expressions as sequences of bras and kets glued together by gener- 
alised 'multiplication' operators. Some control is usually provided over the commuta- 
tivity properties of the bras and kets with respect to these generalised multiplications, 
with varying degrees of sophistication. These commutativity properties are then tuned 
to match the complex manipulation rules of Dirac's notation. This approach is suffi- 
cient to model simple manipulations of Dirac expressions. However, as these systems 
are not capturing the full context associated with these expressions, they are bound 
to be limited in their scope. To fully capture this notion of context I argue that one 
must model the full underlying mathematical structure. In this paper then, rather than 
attempt to implement a computer algebra system for Dirac's notation directly, I will de- 
scribe a computer algebra system for the underlying mathematical structure. If desired, 

^Sometimes the tensor product is explicitly indicated in particularly ambiguous, or didactic, contexts. 



3 



it is reasonably straightforward to add Dirac's notation as an ambiguous input/output 
representation. 

3 Quantum algebra 

In this section I describe the mathematical formalism behind (non-relativistic) quantum 
mechanics. The treatment will be brief and serves to provide a self-contained definition 
to refer to. The reader is directed towards [1] for more details. In the rest of the 
paper I will loosely describe this mathematical structure as the "quantum algebra". 
For simplicity of presentation I shall ignore the tensor product for the moment. I will 
remove this simplification in section|2l 

The fundamental elements of the algebra are (complex) Hilbert space vectors. I 
will denote vectors by an overhead arrow e.g. tp . Complex linear combinations of 
vectors are also vectors 

(j) = 3x + 4:1^ , 

where, as usual, the addition operation is associative and commutative. 

The space is equipped with an inner product, which is a sesquilinear operator that 
maps two vectors into a complex number The inner product is denoted, following the 
usual mathematicians' convention, as a round-bracketed pair of vectors, separated by a 
comma. Sesquilinearity means the following properties hold 

i$,a^+bx) = a{$J) + b{(f,x) , (3) 
ia$+b^,x)^a*{$,x)+b*ii^,x), (4) 

where I have followed the physicists' convention of making the product conjugate- 
linear in its first argument. 

Operators are maps from a Hilbert space into itself They will be denoted by 
overhead hats, as usual. Operators are linear maps, in the sense that 

d{a$+biP) ^ ad{(t^) + bd{iJ) . (5) 

Operators are also elements of a complex vector space, so complex linear combinations 
of operators are themselves operators, 

d = P + 3iQ . 

As with vectors, this addition operator is associative and commutative. Operator appli- 
cation is linear with respect to the vector space structure of operators, 

{ad + bP){$)^ad{$) + bP{$) . (6) 

Operators can be composed to form new operator^ I will represent composition by a 
dot, 

O^P-Q. 

The action of a composite operator on a state is defined by 

(P • Q)^ = P{QW) ■ (7) 

^Loosely speaking. There is some subtlety to do with domains that is rarely important. 
''This is sometimes misleadingly referred to as multiplication of operators. 
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The final objects to deal with are Dirac's bras. Dirac introduces the dual space to 
the Hilbert space of kets, the space of bras. This introduction is unnecessary and some 
authors have argued that it creates more problems than it solves li 1 1 il . I will take an 
alternative approach. It is sufficient to introduce a projection operator, parameterised 
by two vectors, with definition 

M[^Jm^{l6)^ . (8) 

It is straightforward to show that this does indeed satisfy the linearity requirement for 
an operator. This approach has the advantage that rules for operators will apply to the 
projector, minimising the amount of duplication. The operator M is equivalent to the 
Dirac construction 

M[^J] = \^){c^\ . 

All expressions featuring bras in Dirac's notation can either be rewritten using the 
above projection operators, or else their dual can|f| 

Bases are not a fundamental, nor necessary, ingredient of the algebra. They are 
however often essential to make progress in a calculation. We will consider bases 
again in section[8] 

4 Term structure 

I have described above the mathematical structure that underlies quantum mechanical 
calculations. Our aim in this paper is to develop a framework for automating calcula- 
tions in this algebra. The computer is a mechanistic device, and as such we need to 
provide it with a set of mechanical rules for valid, structural transformations of quan- 
tum algebra expressions. The first step in developing these rules is to characterise the 
structure of possible expressions in the quantum algebra. To effect this characterisation 
we need to consider a second, very different, mathematical construction: a set of terms. 

The notion of a term comes from the field of universal algebra 1 12|. Informally, a 
term looks like a nested set of function-calls i.e. f{x, g{y)) or hi). Formally, a set of 
terms is defined by a signature, S, and a set of variables, X. The signature is a set of 
function symbols (/, g, and h in the above examples) and their arities. The arity of a 
function symbol is a specification of how many, and what sort of, arguments it takes. 
The sort of the argument is required because there are distinct types of mathematical 
objects in our quantum algebra: vectors, operators and scalars. It does not, for example, 
make sense to take the inner product of a vector and an operator In the literature of 
term-rewriting and universal algebra a set of terms with sortec0 arities, is known as a 
many-sorted term system. The variables X are simply a set of symbols, distinct from 
the function symbols in the signature. They can be used to stand for any valid term 
in the definition of a rule (see section |5]l. Terms are defined inductively, as either a 
variable, or a function symbol applied to other valid terms of a suitable sort. 

To model the structure of expressions in the quantum algebra I introduce the fol- 
lowing signature. First, there are function symbols for combining scalars, with their 

'This is not strictly true, as one could imagine a space that is constructed as the tensor product of a ket- 
space and a bra-space \a) ® {b\. For all practical purposes, though, this could be replaced with the equivalent 
ket-ket space \a) \b), and the projection operators are once again sufficient. 

^This confusing use of the word sorted is, unfortunately, established in the literature. It should be read as 
"having a sort", not the more usual "put in order". 
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associated arities 



conjugate scalar — > scalar , 

plusS scalar x scalar scalar , 

timesS scalar x scalar — > scalar . 

where the arity of plusS, for instance, should be understood as indicating that the 
function symbol plusS requires two scalar arguments, and is itself of scalar sort. This 
is a minimal set of operations on scalars. Any realistic computer algebra system will 
have a much larger set of scalar representatives (integrals, sums, fractions and the like). 
As computer algebra techniques for scalars are well-established I will not consider 
them further, and keep only this minimal set of operations. 

Next, there are function symbols that represent linear combinations of vectors and 
operators 

plusV vector x vector — > vector , 

timesV scalar x vector — > vector , 

plusO operator x operator — > operator , 

timesO scalar x operator — > operator . 

These are followed by function symbols to represent the operations of taking the 
inner product, applying an operator, and composing two operators 

ip vector x vector — > scalar , 

apply operator x vector — > vector , 

compose operator x operator — > operator . 

Finally, I introduce a function symbol to represent the projection operators (equa- 
tionlSJ that will take the place of Dirac's kets 

projector vector x vector — > operator . 

In addition to the above function symbols I shall assume that there is a ready sup- 
ply of constant (unary) function symbols of all sorts for representing constant vectors, 
scalars, and operators. I will introduce variables in section|5] 

Let us consider an example. The Dirac expression 

(2p + 5g)(3|V') + (a|/3)|^)), 

is represented by the term 
apply( 

plusO( timesD(2,p), timesO(5,(7) ), 
plusV( timesV(3, -i/;), timesV( ip((3, 4>)) 



where 2,3,5,p,q,a.,P,(j), and ip are constant symbols of scalar, operator and vector 
sorts respectively. 

It might seem as though I've just invented a new and rather clunky notation for 
expressions in the quantum algebra. But the significance is deeper than that: the term 
specification above gives a formal definition of all possible expressions in the quantum 
algebra and what sort of expression they are. The beauty of the term representation is 
in its regularity and lack of ambiguity. 
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5 Rewrite rules 



In section [3] the mathematical structure of quantum mechanics calculations in the in- 
variant formalism was outlined. Then, in section IH a second mathematical structure 
was introduced, one that describes every possible expression that can be written in the 
algebra of section[3] In this section I will make a link between these two structures by 
describing which term-expressions correspond to things that, in the quantum algebra, 
have the same value. This is the essence of algebraic manipulation: changing the form 
of an expression without changing its value. 

The mathematical tool that I will use to make the link is what is known as a term- 
rewriting system (TRS). A TRS is a set of terms, as defined above, and a set of rules 
that transform one term into another A rule is applied by taking a term, or part of a 
term, that matches the left hand side of the rule and replacing it with the right hand 
side. Rules can contain variables which can represent any term of the appropriate sort. 
Readers seeking formal definitions of a rule, matching, and rule application will find 
them in ifTZI . 

The rules for the quantum algebra are as follows. The rules are written with left- 
right arrows to indicate that, as described below, they can be used in both directions. I 
will assume that suitable rules for the pure scalar operations, conjugate, plusS, and 
timesS, will be built in to any computer algebra system and will not present them here. 



The first three rules rewrite linear combinations of vectors 

timesV( a, plusV(wi, 112) ) plusV( timesV(a, wi), tiinesV(a, ^2) ) , (9) 

timesV( plusS(ai, 02), Wi ) ^ plusV( timesV(ai, w), timesV(a2, w) ) , (10) 

tiinesV( ai, timesV(a2, w) ) ^ timesV( timesS(ai, 02), v). (11) 



where here, and in what follows, symbols beginning a, v and o are variables (members 
of X) that can stand for any term of scalar, vector, and operator sort respectively. 
The next three rules are the equivalent of the above for operators 

timesO( a, plusO(oi, 02) ) <-> plusO( timesO(a, oi), timesO(a, 02) ) , (12) 
timesO( plusS(ai, a2), o ) <-> plusO( timesO(ai, o), timesD(a2, o) ) , (13) 



timesQ( ai, timesD(a2, o) ) <-> timesQ( timesS(ai, a2), o ) . (14) 

The sesquilinearity of the inner product ^ is captured with the next set of rules 

ip( vi, plusV(w2, U3) ) ^ plusS( ip('i;i, U2), ip(wi, W3 ) ) , (15) 

ip( plusV(i>i, U2), U3 ) ^ plusS( ip(wi, U3), ip(w2, V3) ) , (16) 

ip( wi, timesV(a, V2) ) ^ timesS( a, ip(wi, ^2) ) , (17) 

ip( timesV(a, ui), V2 ) ^ timesS( conjugate(a), ip(wi,W2) ) . (18) 

The following set of rules deal with the biUnearity of operator appUcation (|5]and|6ll 
and the action of composite operators ^ 

apply( o, plusV(ui,t;2) ) ^ plusV( apply(o, -wi), apply(o, ■i;2) ) , (19) 

apply( o, timesV(a, ti) ) ^ timesV( a, apply(o, u) ) , (20) 

apply(plus0(oi,O2), w )<-> plusV( apply (oij-y), apply(o2, w) ) , (21) 

apply( timesO(a, o), w ) <-> timesV( a, apply(o, v) ) , (22) 

apply( compose(oi, 02), v ) ^ apply( oi, apply(o2, w) ) . (23) 
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The next rule describes the operation of the projection operators (O 

apply( projector(wi,'i;2), V3 ) ^ timesV( ip(u2,W3), fi ) ■ (24) 

Finally, the following rules capture the associative and commutative nature of vec- 
tor and operator addition 

plusV(ui, 172) <-> plusV(u2, t^i) , (25) 
plusV( vi, plusV(w2,W3) ) ^ plusV( p1usV(di,D2), V3 ) , (26) 

plusD(oi, 02) ^ plus0(o2, oi) , (27) 
plusO( oi, plus0(o2, 03) ) ^ plusD( plusO(oi, 02), 03 ) . (28) 

It is usual to avoid adding explicit rules such as these to capture commutativity and 
associativity, as they lend the TRS some undesirable properties. Rather, rule matching 
is usually extended to include matching modulo associativity or commutativity. For our 
purposes, though, this will add unnecessary complication and, for now, adding these 
associative and commutative rules will have no ill effect. 

The rules allow us to manipulate expressions without changing their value. Any 
expression can be transformed into any other with the same value by successive appli- 
cation of these rules, either in the forward of reverse direction. Formally, the transitive 
reflexive closure of these rules generates an equivalence relation on the set of terms. 
The equivalence classes of this relation are precisely the sets of terms that correspond 
to expressions in the quantum algebra that have the same value. 



6 Simple example 

Let us consider an example to demonstrate the action of the TRS. We will expand the 
application of a projection operator to a state 

To describe the application of rules from the TRS to parts of an expression we 
need to define position within a term. Each part of a term, apart from the term itself, 
is the argument to a function symbol. Thus, any part of a term can be specified by 
saying which argument of the enclosing function symbol it is, and which argument the 
enclosing function symbol is of its enclosing symbol, and so on, until the root of the 
term is reached. This list of integers is the position. Conventionally the list is ordered 
to start from the root. The term itself is given the position e. 

Table [T] shows step-by-step how the term evolves. First the scalar pre-factor is 
brought out of the operator application. The second step uses the linearity of the op- 
erator to split the expression into a sum of two vectors. After removing another scalar 
factor from an operator application in the third step, the projection operators are ap- 
plied. Finally, the expression is rearranged into a conventional form by manipulating 
the scalar mulitplying the vector Note that if the Dirac-form expressions were writ- 
ten conventionally, without the extra bracketing, a number of expressions at successive 
steps would be identical. 

This example may seem facile, but it is not. The important point is that at each step 
the expression had an unambiguous representation and each step involved applying 
a particular rule to a well-defined position. It is this rigorously defined mechanical 
precision that is essential to a well-functioning computer algebra system. 
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Term 



Dirac-form 



Rule Position 



apply( projector( a, a ), timesV( plusV( /3, timesV( —1, 7 ) ) ) ) 
timesV( apply( projector( a, a ), plusV( /3, timesV( —1, 7 ) ) ) ) 

timesV( plusV( apply( projector( a, a), (3 ), apply( projector( a, a ), timesV( —1, 7 ) ) ) ) 
timesV( plusV( apply( projector( a, a), (3 ), timesV( —1, apply( projector( a, a), 7 ) ) ) ) 
timesV( plusV( timesV( ip( a, (3 ), a ), timesV( —1, apply( projector( a, a), 7 ) ) ) ) 
timesV( plusV( timesV( ip( a, /3), a ), timesV( —1, timesV( ip( a, 7 ), a) ) ) ) 
timesV( plusV( timesV( ip( a, /3 ), a ), timesV( timesS( —1, ip( a, 7 ) ), c? ) ) ) 
timesV( timesV( plusS( ip( a, (3 ), timesS( —1, ip( a, 7 ) ) ), a)) 

timesV( timesS( plusS( ip( a, (3 ), timesS( —1, ip( a, 7 ) ) ) ), a) 



l«> {a\{^^m + {-\l)))) 
-Ijla) (a|(|/3)+(-|7))) 
^(|a) H(|/3)) + |a> H(-|7») 
^(|a) (a|(|/3))-|a) (a|(|7))) 
^((a,/3)|a)-|a) H(|7))) 
^((a,/3)|a)-(a,7)|a)) 
^((a,/3)|a) + (- (a,7»N) 
^(((a,/3)-(a,7))|a» 
(-i^((a,/3)-(a,7)))|a) 



20] 

m 
20] 

24] 
24] 

m 

TO] 
IT] 



2 

2,2 

2,1 

2,2,2 

2,2 

2 



Table 1 : Application of a projection operator The first column shows the term as successive rewrite rules are applied to it. The second column shows 
the equivalent Dirac-form. More bracketing than would be usual has been used to better reflect the structure of the term. The third column shows the 
rule that has been applied and the direction in which it was used. The final column shows the position at which the rule was applied. 



7 Extension to tensor product spaces 



Thus far we have only dealt with the quantum mechanics of isolated, single systems. 
To accommodate the description of multipartite systems we need to remove the simpli- 
fication made in section |3] and introduce the tensor product. 

If the states of two systems are represented by vectors in the Hilbert spaces Hi and 
Ti.2 respectiveljQ, then states of the composite system are represented by vectors in the 
tensor product space Hi ^ H2- It is this tensor product structure that is responsible 
for many interesting effects in quantum mechanics. In this section I consider how to 
extend the term-rewriting system to tensor product spaces. 



7.1 Quantum algebra 

The quantum algebra is extended as follows. The states will be labelled with a subscript 
that indicates the space e.g. -01 is a vector in space Hi- 

The tensor product of two vectors will be indicated in the usual way, 

V'l (8) 02 ■ 

The tensor product operator is associative and commutative. The product is linear in 
its two arguments 

01 (E) (a 05*2 + b02) = a(0i (g) 0^^) + 6(0i » 62) , 
(a 01 + 60i) (K)^2 = a{$i (g 62) + b{ipi (g) 62) . 

It is necessary to extend the definition of the inner-product over tensor products of 
vectors, 

(01®02,^l®/32) = (0^1,e"l)(02,/32) . (30) 

Note that this inner product is only defined if the two vectors belong to the same space. 

Operators are also labelled with a subscript to indicate the space they act on. Tensor 
products of operators can be defined, and as with vectors, the operator tensor product 
is associative, commutative, and linear in its arguments 

Pi <g{aq2 + bh) = cl{pi (g) 52) + b{pi (X) h) , ^^^^ 
{api + b qi) ® 7-2 ^ a{pi ® 7^2) + b{qi (g 7^2) ■ 

The action of a tensor product operator on a tensor product of vectors is defined by, 

Ol®P2(01®02) = Oi(0i)®P2(02). (32) 

As with vectors, a subscript on an operator indicates which space it acts or0. 



7.2 Terms 

Vectors, and operators can now belong to different Hilbert spaces. We embed this 
information in the term structure by parameterising the sorts. We will represent vectors 

'Often these spaces will be, in a formal sense, the same space — consider, for instance, two coupled 
qubits. We will nonetheless treat these spaces as distinct as they pertain to different physical systems. 

^'Note that only tensor product operators can act on tensor product states. Often when an operator in one 
space is tensored with the identity operator in another, the identity is omitted. That is not admissable here. 
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and operators associated with the space Hi ^ Ti.2 <Xi • • • by terms with sorts 

vectorpii (8) ® • ■ ■ ] , 
operatorpii 7^2 ® • ■ ■ ] ■ 

The term structure of section |4] is extended in the obvious way using the new param- 
eterised sorts. Two additional function symbols are needed to express tensor products 
of vectors and operators respectively 

tensorV vector[Hi] x vector[H2] vector[Hi (^0.2] , 

tensorO operator[T-li] x operatorpi.2] operator\Hi ®TL2\ ■ 

The use of the tensor product notation inside the parameterised sort is more than just a 
notational convenience. It is meant to be taken as implying that there are isomorphisms 
from the semigroup of Hilbert spaces to the sorts of vectors and operators, with respect 
to the tensor product operation. For example, consider two tensor product expressions 
of vectors from four vector spaces 



I have introduced the operator sort which reveals the sort of a term. The isomorphism 
requires that the structure of the tensor product operation is respected by the sort struc- 
ture, meaning that in this case the following must hold 



7.3 Rewrite rules 

We need to extend the rewrite rules to work with tensor products. The first set of 
additional rules capture the linearity of the tensor product operation with respect to 
vectors ( |29] ) 

teiisorV(i;i, plusV(w2, fs)) ^ plusV(tensorV(tii, W2), teiisorV(tii, W3)) , (33) 

terLSorV(plusV(t;i, W2), W3) ^ plusV(tensorV(tii, W3), teiisorV(z;2, W3)) , (34) 

tensorV(timesV(a, wi), f2) ^ timesV(a, tensorV(tii, W2)) , (35) 

tensorV(i;i, timesV(a; W2)) ^ timesV(a, teiisorV(tii, U2)) . (36) 

An equivalent set of rules are needed for tensor products of operators ( |3T1 ) 

teiisorO(oi, plus0(o2, 03)) ^ plusO(tensorO(oi, 02), teiisorO(oi, 03)) , (37) 

tensorO(plusO(oi, 02), 03) ^ plusO(tensorD(oi, 03), teiisor0(o2, 03)) , (38) 

tensorO(timesO(a, oi), 02) ^ timesO(a, tensorD(oi, 02)) , (39) 

tensorO(oi, timesD(a, 02)) ^ timesO(a, tensorO(oi, 02)) . (40) 



tensorV(tii, 112) , and 
teiisorV(ti3, V4) 



where. 



sort('yi) 
sort(i;2) 
sort(w3) 
sort(w4) 



vector[Hi ® H2] , 
vectorp-ls] , 
vector[Hi] , 
vector[H2 ® H3] . 



sort(tensorV(wi, i;2)) = sort(tensorV(w3, D4)) . 
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Once again, explicit rules are added to represent the associativity and commutativity of 
the vector and operator tensor products 

tensorV(t;i, W2) <-> tensorV(u2, 1'l) , (41) 

tensorV( wi, tensorV(u2, ^3) ) ^ tensorV( tensorV(ui, 7J2), ^3 ) , (42) 

tensorD(oi, 02) ^ tensor0(o2, oi) , (43) 

tensorD( oi, tensor0(o2, 03) ) tensorO( tensorO(oi, 02), 03 ) . (44) 

Inner products of tensor products ( l30l l are defined by the rule 

ip(tensorV(wi, U2), ■tensorV(u3, W4)) ^ timesS(ip(ui, 173), ip(w2, ^^4)) ■ (45) 

And, finally, the operation of tensor product operators on tensor product states i 

apply(tensorO(oi, 02), tensorV(vi, V2)) <-> tensorV(apply(oi, vi), apply(o2, V2)) 

(46) 

8 Example: quantum teleportation 

Quantum teleportation is a well-known illustration of the behaviour of entangled sys- 
tems IIT3I . Alice and Bob share a pair of qubits that they have previously entangled into 
a Bell-state 

|0) = -i=(|O)jO), + |l)Jl),). (47) 

where a is the Hilbert space of Alice's qubit and b is the space of Bob's qubit. All of 
the states are written in the so-called computational basis. Alice wishes to transfer the 
state of a second qubit, in space 02, to Bob 

|</.)=a|0)„^+/3|l)„^ . (48) 

The state of the whole system is the tensor product of the entangled qubits and Alice's 
private qubit 

1^) = 1^) ^ \e) = i=(a |0),^ + P |1),J(|0)„ |0), + . (49) 

Ahces carries out the teleportation by first applying a CNOT gate to her two qubits, 
using 02 as the control, followed by a Hadamard gate to her second qubit. It is straight- 
forward to show that the resultant state of the system is 

1^)' = \i |0)., |0)„ {a |0), + /3 |1) J + |0),^ {a\l), + /3 |0) J + ^^^^ 
|l>a. |0). (« |0), - /3 |1) J + (a - 13 |0) J ) . 

Alice now measures the state of her qubits in the computational basis. It is clear that 
if she measures both qubits into state |0) then she has successfully teleported her qubit 
state to Bob. The other possible outcomes for Alice's measurement are straightfor- 
wardly dealt with. 

This computation can be carried out with the TRS. We start by representing the 
starting state ( |49] | as a term expression. To make the presentation more readable I will 
name parts of the term expression with := . The shared Bell state is ( |47] ) 

6 := timesV( —=, plusV( teiisorV( Oq, Qt ), tensorV( la, 1^ ) ) ) , 
v2 
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and Alice's state to teleport is ( |48] ) 



(j) := plusV( timesV( a, Oaa ), timesV( /3, ) ) • 

The starting state is the tensor product of these states 

:= tensorV( cf), 6 ) . 

We need to define the action of the CNOT and Hadamard operators within the 
TRS. To do this we provide additional rules - effectively teaching the TRS about the 
new operators. The Hadamard operator is defined by the rules 

apply( K, Os) timesV( plusV(0,, 1,) ) , (51) 

apply( hs, Is) timesV( plusV( 0^, timesV(-l, Is))) , (52) 

for any qubit space ,s. Notice that the operator is specified with respect to particular 
states (constant symbols, not variables) in a particular basis. In this problem we shall 
always work in the computational basis, but in more complex problems it is often 
desirable to work in several bases. In this case it would be necessary to adorn the 
vector symbols and rules with basis information, as well as providing rules for changing 
between bases. 

The CNOT operator acts on a pair of qubits, in spaces si and §2, and has sort 
operator[si (g) §2]. It is defined by the rules 

apply( Csi,s2i tensorV(0si,0s2) ) tensorV(()s^ , O^J , (53) 

apply( Csi,s2, tensorV(05i , Is J ) ^ tensorV(Osj , 1^ J , (54) 

apply( Csi,s2, tensorV(l5i,0sJ ) -> tensorV(lsj , l^J , (55) 

apply( Csi^s2, tensorV(l5i, IsJ ) ^ tensorV(lsj , O^a) . (56) 

To construct the operators that act on the three-qubit system a third operator is 
needed - the identity operator - defined by the trivial rule 

apply( ids, Vs) ^ Vs 

for any space s and any vector Vs- 

Using these operators we can write the teleported state as 

apply( compose( teiisorD( tensorO( /iq^, ida, idj, ) ), tensorD( Ca2,a, idb )), ip) ■ 

Our task is to rewrite this expression using the rules defined above into the form of ( fSOl l. 
The details of this manipulation will not be presented as it involves 123 successive 
applications of the rules. In broad outline: the tensor product state is first expanded 
to a sum of tensor product basis states; the operator composition is then expanded and 
the CNOT operator applied; the Hadamard operator is applied and the resulting state 
is further expanded; finally, the terms are rearranged and collected to yield the result. 
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The resulting term expression is 
, 1 

timesV — , 
plusV( plusV( 

tensorV( teiisorV( 1^2, Oa ), plusV( timesV( a, Of, ), timesV(— /3, If, ) ) ), 
plusV( 

teiisorV( teiisorV( Oaa, la ), plusV( timesV( a, If, ), timesV( (3, Ob) ) ), 
tensorV( teiisorV( laj, la ), plusV( timesV( a, If, ), timesV( — Of, ) ) ) ) ), 
tensorV( teiisorV( Oaj, Oa ), plusV( timesV( a, Of, ), timesV( /3, 1 f, ) ) ) ) ) . 

which can be seen to represent the Dirac-form expression ( ISOt . It is perhaps surprising 
that a seemingly simple manipulation requires so many rule applications. Of the 123 
applications 24 are of associativity and commutativity rules and could be eliminated 
by using a more sophisticated matching system. However, 99 rule applications still 
remain, the majority of which are of rules expanding tensor products of vector sums. It 
is testament to the power of mathematical notation in general, and Dirac's notation in 
particular, that so many formal operations can be performed so quickly by hand. 

9 Implementation 

It is desirable to implement the above described TRS in a mainstream computer al- 
gebra package. Calculations might then benefit from integration with the numerical, 
(scalar) analytic and visualization facilities of these packages. Most computer algebra 
packages offer some mechanism for implementing rewrite systems. However, as far 
as the author is aware, no general-purpose, mainstream package provides support for 
the algebraically parameterised sorts used in this paper This section briefly describes 
a solution to this problem. Other details of the implementation are largely obvioufl 

A proof of principle implementation has been developed in the Mathematica [3] 
computer algebra package. Mathematica has a sophisticated conditional rewriting en- 
gine, but offers no support for sorts - all term expressions in Mathematica are of the 
same sort. It is straightforward, though, to add functions which reveal the sort of a 
term in the quantum algebra TRS. For convenience the problem is split into two parts. 
First, predicates are provided which indicate whether a term is a vector, operator, or 
scalar: vectorQ , operatorQ , and scalarQ. The inductively defined term struc- 
ture given in section |4] naturally suggests a recursive implementation. For example, 

'a proof-of-principle implementation may be obtained from the author's website ll4l . 
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vectorQ is defined as 

vectorQ = MatchQ[#, 
vector [ ] [ ] 

I (plusV[ vl_?vectorQ, v2_?vectorQ ] 

/; ( spaceOf[vl] == spaceOf[v2] )) 

I timesV[ _?scalarQ, _?vectorQ ] 

I (apply [ o_?operatorQ, v_?vectorQ ] 

/; ( spaceOf[o] == spaceOf [v] )) 

I tensorV[ _?vectorQ, _?vectorQ ] 

] &; 



Note that vector and operator constants are indicated by expressions that have head 
vector and operator respectively. The first curried argument of a vector or oper- 
ator constant is used to indicate the space to which it belongs. The second argument 
(or argument list) names the constant, and can carry ancillary information, for instance 
about the basis to which the name refers. 

Second, a function is provided that returns the Hilbert space to which a vector 
or operator belongs. For brevity, only a few clauses of the function are presented to 
indicate the general idea 

spaceOf [vector [a_] [ ]] := a 

spaceOf [timesV [ s_?scalarQ, v_?vectorQ] ] := spaceOf [v] 
spaceOf [apply [o_?operatorQ, v_?vectorQ] ] := sameSpace [o, v] 
spaceOf [tensorV [vl_?vectorQ, v2_?vectorQ] ] := 

tensorSpace [spaceOf [vl] , spaceOf [v2] ] 

The function sameSpace returns the space its two arguments belong to, if they are 
the same, otherwise generating an error. Of note is the function tensorSpace. It is 
this function that ensures that the sorts respect the semi-group structure of the Hilbert 
space tensor product. Its implementation is simple 

tensorSpace [ s 1_, s2_] := Sort [ Join [ si , s2 ] ] 

with the set union providing the required algebraic structure. Rules of the TRS are 
implemented as conditional rewrite rules, conditioned on appropriate results of these 
functions that reveal the sort of a term. 

This implementation is extremely inefficient and makes no attempt to integrate with 
the general-purpose algebraic functions of Mathematica. Furthermore, error checking 
is incomplete as a term's sort is only checked when a conditional rewrite rule is ap- 
plied. It is an open research question how to best develop a well-integrated, efficient 
implementation of the system in a general-purpose computer algebra package. 



10 Conclusion 

I have described the mathematical framework for building a computer algebra system 
for representation-invariant quantum mechanical calculations. This framework sig- 
nificantly differs from all previous work by capturing the underlying structure of the 
mathematics of quantum mechanics, rather than directly capturing the most conraionly 
used notation. I have demonstrated by example that the framework can be used to solve 
non- trivial problems. 
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A number of research directions are suggested by this work. Further formal charac- 
terisation of the TRS would be of interest; in particular determining whether the TRS 
has the desirable properties of termination and confluence. Integration of the system 
into a general-purpose computer algebra system also requires further work. The un- 
sorted rewrite-systems that most packages are built on present a particular problem for 
the implementation of complex algebraic systems of the type described herein. 

1 would like to thank Mike Tarbutt and Ben Sauer for their comments on this 
manuscript. The author is supported by an STFC Advanced Fellowship. 
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